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ABSTRACT 

About 3515 genomic and EST sequences from the public database were mined in-silico for microsatellite 
repeats containing di-, tri-, tetra- and Penta-nucleotide repeat motifs and 3334 primer pairs were designed. One hundred 
primers were synthesized and validated in Eleusine coracana species, and the polymorphic microsatellites were used for 
characterizing 32 local land races and commercial varieties collected from five different districts in Karnataka. The 
studies were focused on estimating genetic parameters of the markers, genetic diversity and population structure. 
Functional annotation and gene ontology was assigned using Uniprot database under the categories of molecular 
function, biological process and cellular component. The mean values of the number of alleles per locus, diversity index 
and polymorphic information, content were 4.6, 0.5 and 0.43 respectively. The population structure analysis performed 
under an admixture model with correlated allele frequencies using STRUCTURE, and the principal coordinate analysis 
indicated four subgroups within the population. While AMOVA indicated significant variation within and among 
populations, Hardy- Weinberg test indicated no significant deviations in majority of the markers. 
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INTRODUCTION 

Finger millet ( Eleusine coracana L.) commonly known as Ragi, is one of the extensively grown minor 
millets in the regions of India and Africa. It ranks sixth in production after wheat, rice, maize, sorghum and Bajra in 
India. It is considered as one of the most nutritious cereal due to its abundance of high calcium, iron, protein, fiber 
and other minerals. It is a rich source of non-available carbohydrates with low glycemic index, which is a boon for 
cardiovascular and diabetic patients. The antioxidant properties resulting from phytates, polyphenols and tannins 
makes it anti-cancer and anti-aging food crop (Chandra et al., 2016). Besides its nutritional and health benefitting 
properties, the superior adaptability traits of this crop make it as a storehouse of important genomic resources for 
crop improvement (Sood et al, 2016). Finger millet is an allopolyploid derived from ‘AA’ and ‘BB’ genomes with 
chromosome number 2n = 4x = 36. Though the crop originates from Africa, the Indian sub population is reported to 
be low in genetic/sequence diversity (Salimath et al., 1995). The knowledge of genetic diversity estimated by 
molecular markers is essential to select genetically distinct germplasm for hybridization programs. In the absence 
of finger millet genome sequence, attempts have been made to develop microsatellite markers and single nucleotide 
polymorphism (SNPs) (Gimode et al., 2016). Markers have been employed to study disease resistance, calcium 


www.tiprc.org 


editor@tjprc. org 


Original Article 



278 


Nagabhushana Kotrappa, Shailaja Hittalmani & Krishna V enkatarangaiah 


dynamics, and in association mapping of various agronomic and nutritional traits (Yadav, et al., 2014b; Babu et al., 2014a, 
c& d). However, like many orphan crops, finger millet lacks genomic resources sufficient for its genetic improvement. 
Microsatellites (also called as simple sequence repeats: SSRs) are still the cost-effective and robust markers that offer high 
reproducibility and polymorphism. Currently, there are < 300 informative microsatellites in finger millet and a couple of 
SNP studies (Sood et al., 2016). While the development of these markers in the native species is expensive, designing 
EST-derived markers using the public sequences become cost effective and will add to the limited genomic resources 
available in this species. Besides wild cultivars that serve as gene donors, the local cultivars and landraces with different 
attributes offer enormous genetic variability and could be used in varietal improvement program. In this study, the publicly 
available genomic and expressed sequence tag (EST) sequences reported for various traits, including NBS-LRR regions, 
disease resistance and abiotic stress tolerance from nine Eleusine species were collected and screened for tandem repeats, 
in order to develop functionally validated EST-derived microsatellites and were employed in studying local cultivars and 
landraces of finger millet. 

MATERIAL AND METHODS 

Genomic DNA was isolated from 32 genotypes collected from five regions within Karnataka, ranging from the 
southern dry region to western Malnad region (Table 1) using a CTAB method (Doyle and Doyle et al., 1990). A set of 
3515 publicly available genomic and EST sequences was obtained from NCBI database and the sequences with >80 % ‘N’ 
and <50 bp were trimmed off to get high quality ESTs for mining repeat motifs. Repeat prediction was performed using 
MISA (Thiel et al., 2003) with a moderately stringent parameters of 5 repeats for di-nucleotide repeats and 3 repeats for tri- 
nucleotide repeats and 2 repeats for tetra-nucleotide repeats besides compound repeats. Primers were designed in SSR 
containing sequences using the Primer3 tool (Untergrasser et al., 2012) with the optimum standards of 50% GC content, 
melting temperature of 58°C, primer length of 20 bases and the PCR product size ranging from 150 to 300 bases. The 
annotation of the primers was carried out and grouped under the functional categories of molecular function, biological 
process and cellular component using UniProt database. 


Table 1: List of Finger Millet Genotypes Collected from 5 Different Districts in Karnataka 


Location 

Local Landraces 

Commercial Cultivars 

Chintamani 

Nalla, Mooruthingala, Gidda, Thella, Indaf 

Indaf-5 

Pavagada 

Kappu, Kunte, Bili, Dappa 

- 

Shimoga 

Karimundaga, Makari, Mooruthingala, Kapputhene, Bili, Kari kaddi, Malali, 
Dodda, Sannakaddi, Jainumandaga, Thaidala, Kolimade, Bonda and Jagaluru 

- 

Mandya 

- 

MR-1, MR-6, KMR-204 
and KMR-301 

Bengaluru 

- 

ML-322, ML-31, ML- 
365 and Indaf-7 


Polymerase chain reaction (PCR) was performed by the components of template DNA (20 ng), primers (5pM), 
10X PCR buffer. dNTPs (1 pM), MgCl 2 (1.5mM) and Amplitaq Gold Taq DNA polymerase (0.3 U) in a reaction volume 
of 20 pL. The PCR profile with the step-down annealing conditions (T A ) was performed with the initial denaturing of 
template DNA at 94°C for 5 min, followed by 35 cycles of 94°C for 60 Sec, T A for 30 Sec and 72°C for 60 Sec, followed 
by a final extension at 72°C for 8 min. The amplicons were resolved on 2 % high resolution agarose along with the size 
standards and the allelic data was extracted based on the size. The marker data were analyzed for genetic diversity 
parameters such as expected heterozygosity ( H e ) (~ Diversity Index: DI), Polymorphism Information content (PIC) and 
Hardy-Weinberg equilibrium test using CERVUS 3.0 (Kalinowskiet al. 2007). Genalex (Peakall and Smouse, 2012) and 
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STRUCTURE V2 (Pritchard et al., 2000a) were employed for estimating, analysis of molecular variance (AMOVA), 
principal co-ordinate analysis (PCoA) and population structure respectively. 

RESULTS AND DISCUSSIONS 

The microsatellite analysis of 3515 genomic and EST sequences indicated the presence of tandem repeats in 2602 
sequences, resulting in the detection of 4530 SSRs. The higher number of SSRs was possible due to the presence of 
multiple repeat motifs in 1789 sequences. A total of 3334 primers were designed for DI, Tri, tetra and pentarepeat motifs 
along with the compound formats. The repeat prediction using MISA indicated 16%, 42%, 3% and 39% of di- tri-, tetra- 
and Penta-base repeats respectively. Unlike the current reports on prevalence of more di-base repeats, our study indicates 
more of tri-base repeats, which could be due to the inclusion of repeat-enriched nucleotide sequences in SSR discovery. 
Normally, the presence of Penta-repeats is less in finger millet (Babu et al., 2014b), the high numbers in our study is 
attributed to less rigor in its prediction. Among di- and tri-nucleotide repeats identified, AG/CT, CGC & AAG repeats 
were most prevalent, which was in accordance with earlier reports (Sood et al., 2016). 

One hundred informative primers were synthesized and standardized in E. coracana species. Although 88 primers 
showed amplification, only 60 primers showed clear and scorable alleles. From this 60 primers, a set of 38 primers was 
selected for genotyping 23 landraces and 9 cultivated varieties (Table 1). Twenty eight primers were polymorphic that 
amplified a total of 91 alleles with the mean values of 4.6, 0.5 and 0.43corresponding to the number of alleles per locus, 
diversity index and PIC (Table 2). Earlier studies in finger millet indicate the average polymorphisms of 72 % has been 
reported while characterizing 64 Indians and 61 non-Indian finger millet genotypes using 87 genomic SSRs (Ramakrishnan 
et al., 2016). While EST-derived SSRs exhibit low polymorphism, our study shows up to 55 % polymorphism, which is 
higher than the recently reported 48 % in finger millet (Babu et al., 2014b). Majority of the primers were multi-allelic that 
an amplified maximum of up to 12 alleles (Primer ECML036), with an average of 4.6 alleles per locus. Seven primers 
amplified two loci each per locus which is predictable as the cultivated E. coracana is a tetraploid species derived from 
two genomes (Goron and Raizada, 2015). A goodness-of-fit test for HWE indicated 7 loci deviating significantly from 
expectations at P < 0.001 (Bonferroni correction) as the population size was small which could be improved by additional 
tests with large populations of unrelated individuals. The gene ontology (GO) terms retrieved from Uniprot database 
classified the functions into 6 major groups in the categories of molecular function, biological process and cellular 
component. Majority the sequences were homologous to the genes responsible for the enzyme catalytic activity that are 
involved in biotic and abiotic stress tolerance (Figure 1). Similarly, the functional annotation of finger millet sequences 
classified 50 % of the genes to catalytic activity in GO terms (Gimode et al., 2016). Functional annotation and 
classification of markers becomes essential as functional SSRs could be helpful in comparative mapping and post 
transcriptional gene silencing (Nagabhushana et al., 2017). 
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Table 2: Details of Twenty Microsatellite Primers Used for Characterization 
of 23 Landraces and 9 Varieties of Finger Millet 


Primer ID 

GenBank ID 

Repeat 

Forward Primer(5'-3') 

Reverse Primer (5'-3') 

Ta 

(°C) 

Size 

(bp) 

k 

He 

PIC 

ECML001 

EU439937.1 

(CGC)3* 

CTGGATCATGCACGAGTACC 

CTTCCTCCTCCT1TGCAGTG 

58 

180 

6 

0.57 

0.46 

ECML004 

BV723953.1 

(TG)6ttg 

IGCATGCAAGCTTTCCCTAT 

ACACCACCAACCCACACA 

58 

260 

8 

0.83 

0.79 

ECML005 

BV723953.1 

(GCGT)4 

CTAGTAGCAGAATCACGCCC 

TGTCATCATCACTCGCATGG 

58 

223 

3 

0.40 

0.33 

ECML006 

BV723951.1 

(CAA)3cac 

CCTCCCTAGCAGAATCAGGT 

TCAGCCTTATACACCGTTGC 

58 

164 

5 

0.57 

0.49 

ECML007 

BV723950.1 

(AC)1I(AG)23 

TTGCTTCTGATGCATCTCCC 

CCCTGACACAGAAACAGGAC 

58 

257 

6 

0.52 

0.45 

ECML010 

BV723947.1 

(cxyiogt 

GGATCCAAACACCCCAICAA 

TTGTCGTGGTAAGCTTTGCT 

58 

173 

9 

0.64 

0.57 

ECML011 

BV723946.1 

(TTG)5tB(GTD7gs 

CTTTGGTGCTGGTGTAGAGG 

TTCATCTCAGCAGCAGCAAA 

58 

274 

5 

0.72 

0.67 

ECML015 

BV723943.1 

(TTG)4ctg 

GAGGATGACTCCCATTGCTG 

CTCTCGCCCACCATAGTAGA 

58 

209 

4 

0.59 

0.51 

ECML021 

BV723932.1 

(TA)5att 

GCGTAAGTCTTGCGGAGTAT 

C C CTC AC CC TCCTATGGTAG 

58 

277 

2 

0.45 

035 

ECML024 

GF107744.1 

(GAT)7 

AGGTGACCTCAAGACCAAGA 

TGGATCCACAGTTGAAGCAC 

58 

282 

4 

0.60 

0.53 

ECML029 

GF107728.1 

(AG)Sgat 

GCCGCGAGCACTAATTTTEA 

GGCAAAAGATACTCTCTCTCCC 

57 

286 

2 

022 

020 

ECML036 

GF107710.1 

(GT)29 

AAAGGTIGTGAGGGAGGAGA 

CAGAACAACAGGACCCACAA 

58 

297 

12 

0.84 

0.81 

ECML051 

JN37885 1.1 

(GCC)3atc 

ATTCGGTTCTGACGATGCAA 

GTGTTTGTCAGCCTGGAGAA 

58 

275 

2 

022 

020 

ECML052 

KC894S19.1 

(GT)37 

GCATATGTGTTGATGTCGGC 

CCCCAAGCTTCGGATCAAC 

58 

178 

2 

022 

020 

ECML071 

GF107753.1 

(GA)21 

CTCGCTAGCAGAATCACAGG 

CTGCCACGAGGATAGATGC 

58 

255 

3 

0.67 

0.58 

ECML078 

EB1S7460.1 

(AGC)7 

GCCAAAGGAAAACTTGAGGC 

AGGCC GAGAGAAAGAGAGAG 

58 

211 

4 

0.50 

037 

ECML079 

EB187410.1 

(TTC)8 

CGGCATGCCATCATTTTGTT 

ACGCCAACAATATCGAACCA 

58 

165 

4 

0.51 

037 

ECMLOSO 

EB086243.1 

(TC)15 

GGGACTCTAGTTTCCGC1TT 

GTTTGGGGAAGAAGCTCCAA 

57 

235 

2 

026 

022 

ECM1086 

AF144580.1 

(GAA)3atg 

CTGGAACCTTTCTCCAACGA 

ACCCCAGAAAATCGAAGCAA 

57 

285 

2 

022 

020 

ECML091 

AY523508.1 

(TCAAT)2tct 

GGGGGAGAGAAAAAC TTTT GG 

ACAATCAAGTCCGTAGCGTC 

57 

264 

6 

0.46 

039 

Mean 







4.6 

0.50 

0.43 


Note: Ta Annealing temperature; k-Number of alleles amplified; //^-Expected heterozygosity; PIC -Polymorphic Information Content. The He values 
in bold numbers indicate markers deviating from Hardy- Weinberg equilibrium (p=0.001) 
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Figure 1: Gene Ontology Based Classification of Markers under Different Functional Categories Viz., 
Molecular Function, Biological Process and Cellular Component. The Numbers 
Embedded in the Graphs Indicate Percent Distribution under Each Group 


The AMOVA results indicated the prevalence of significant variation within (80%) and among (20%) 
populations, indicating the scope for exploiting the genetic diversity for further selection and hybridization programs. 
Similar results of high variation within populations were reported by Ramakrishnan et al., (2016) and Babu et al., (2014b). 
The PCoA analysis drawn from 1 st and 2 nd coordinates based on highest Eigen values indicated wide variation spread 
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across the genotypes from Shimoga (Malnad region), whereas the individuals from other locations were more or less 
clustered together in 4 different groups (Figure 2). As the millets are grown in remote regions, significant diversity persists 
in the local landmarks that are valued for a wide variety of traits. 



♦ CHI 
■ PAV 
A SHI 
xMAN 
xBEN 


Figure 2: Principal Co-Ordinate Scatter Diagram of 23 Landraces and 9 Varieties of Finger Millet Collected from 
5 Locations (CHI: Chintamani, PAV: Pavagada, SHI: Shimoga, MAN: Mandya BEN, Bengaluru). 

The Numbers in the Diagram Indicate Different Clusters Based on Respective Locations 


The population structure analysis was performed with five independent runs of K = 1 to 10 at 5000 Markov chain 
Monte Carlo (MCMC) repetitions and 50000 burn-in period, using prior population information with the admixture model 
assuming correlated allele frequencies. Furthermore, AK values were estimated based on the rate of change in the log 
probability of data between successive K values and Evanno’s correction (Evanno et al., 2005) showed two peaks of AK, 
for K = 1 & 4, suggesting the presence of four subgroups within a major group (Figure 3 & 4). Except few admixtures, the 
grouping of individuals under four subgroups followed their geographical location and these results were in 
correspondence with PCoA results. The individuals from Bengaluru and Mandya were genetically closer than the rest of 
the individuals, as they were all developed varieties. With the limited scope for hybridization between wild species and 
cultivated varieties, the landraces with ample of diversity offer an excellent source of genetic diversity in crop development 
(Goron and Raizada, 2015). Although genomic microsatellite markers are known for their abundance in the genome and 
high polymorphism, EST-derived microsatellites are considered more informative due to their association with functional 
genes and transferability across related species and genera. The association of functionally derived SSR polymorphisms 
with actual traits have been demonstrated in finger millet aiding in cost-effective marker assisted selection programs for 
crop improvement. 
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Figure 3: Population Structure of 23 Landraces and 9 Varieties of Finger Millet. The Different Colors 
Indicate Four Subgroups with Admixture Alleles among Genotypes from 5 Locations Viz., 
C: Chintamani, P: Pavagada, S: Shimoga, M: Mandya, B: Bengaluru 




Figure 4: Structure Estimation of the Number of Populations for K Ranging from 
1 To 10 by AK Values, the Value of K was Significant At K = 4 


CONCLUSIONS 

Even with the recent development of increased genomic resources in finger millet, wide phenotypic diversity is 
not reflected at genetic level due to low genetic polymorphism, which has posed a major challenge in molecular mapping 
approaches. Unlike in major crops, the availability of informative genomic tools and its subsequent utility in 
characterization of wild cultivars and landraces is still a major concern in finger millet, which requires the attention of 
researchers from India and Africa. From this perspective, the current study highlights large scale development of EST- 
derived SSRs and their application in characterizing local landmarks. Therefore, the narrow genetic diversity existing 
within cultivated finger millet can be addressed by utilizing the local landmarks that hold considerable genetic diversity. 
These markers can be successfully utilized in genetic characterization; population studies, genome mapping and marker 
assisted breeding programs. 
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